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8.4.3 Hypothesis Testing for the Mean 

Here, we would like to discuss some common hypothesis testing problems. We assume that we 
have a random sampleIj/ 2 vX from a distribution and our goal is to make inference about 

the mean of the distribution ju. We consider three hypothesis testing problems. The first one is 
a test to decide between the following hypotheses: 


Hq\ /r M 0’ 

H \-M t Mo- 

In this case, the null hypothesis is a simple hypothesis and the alternative hypothesis is a two- 
sided hypothesis (i.e., it includes both ju < ju 0 and // > // () ). We call this hypothesis test a two- 

sided test. The second and the third cases are one-sided tests. More specifically, the second 
case is 


H 0 :[x<fx 0 , 

H \-M > Mo- 

Here, both H Q and are one-sided, so we call this test a one-sided test. The third case is very 
similar to the second case. More specifically, the third scenario is 

Hq. fl > Hq, 

H\-M < Mo- 

In all of the three cases, we use the sample mean 

- x 1 + x 2 + ...+x„ 
x= - 

n 

to define our statistic. In particular, if we know the variance of the X-s, Var(X ; ) = cr 2 , then we 
define our test statistic as the normalized sample mean (assuming Hq ): 
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X /Xq 

W(X 1 ,X 2 ,-,X n )=— T . 

<7/y/i 

If we do not know the variance of the X-s, we use 

X~ Mo 

where S is the sample standard deviation, 

S = ]j 

In any case, we will be able to find the distribution of W, and thus we can design our tests by 
calculating error probabilities. Let us start with the first case. 

Two-sided Tests for the Mean: 

Here, we are given a random sample X lr X 2 ,... r X n from a distribution. Let fx = EX { . Our goal is 
to decide between 

Hq\ [x = /Xq, 

'■ M i 2 Mo- 

Example 8.22 . which we saw previously is an instance of this case. If H 0 is true, we expect X 
to be close to / x 0 , and so we expect W r (X 1 ,X 2 , ■■■,X n ) to be close to 0 (see the definition of W 
above). 

Therefore, we can suggest the following test. Choose a threshold, and call it c. If | W\ < c, 
accept Hq, and if | W\ > c, accept Tf). How do we choose c? If a is the required significance 
level, we must have 

C(type I error) = /’(Reject H 0 \ // () ) 

= P( | W\ > c | H 0 )< a. 
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Thus, we can choose c such that P(\W\ > c \ H 0 ) = a. Let us look at an example. 


Example 8.24 

LetX l5 X 2 v-,X, 7 be a random sample from a N(ju, a ) distribution, where u is unknown but a is 
known. Design a level a test to choose between 

Hq\ fi = /Xq, 

H\- ft i 2 Mo- 


• Solution 

o As discussed above, we let 


X-Mo 

w(x l ,x 2 ,-,x n )=— r . 

cr/A/n 

Note that, assuming H 0 , W ~ A^O, 1). We will choose a threshold, c. If | W\ < c, 
we accept H 0 , and if | W\ > c, accept H { . To choose c, we let 

P( | W\ > c | H 0 ) = a. 

Since the standard nonnal PDF is symmetric around 0, we have 
P(\W\ > c | H 0 ) = 2 P(W> c | H 0 ). 


a 

Thus, we conclude P(W> c\ H 0 ) = -. Therefore, 


Therefore, we accept H 0 if 



X-/x 0 



and reject it otherwise. 
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Relation to Confidence Intervals: It is interesting to examine the above acceptance region. 
Here, we accept H 0 if 


X /Xq 



We can rewrite the above condition as 


Mo E 


a a 

X-z“^=,X+z“-= 
2 sjn 2 y/n 


The above interval should look familiar to you. It is the (1 - a) 100% confidence interval for 
ju Q . This is not a coincidence as there is a general relationship between confidence interval 

problems and hypothesis testing problems. 


Example 8.25 

For the above example ( Example 8.24) . find // the probability of type II error, as a function of 

[X. 


• Solution 

o We have 


/)(/{) = /’(type II error) = /’(accept // 0 | u) 
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m = p 


X~ju 0 

a! 'sjn 


< Z" \ jU 

2 


p ^~ z ifn- x - f ‘ 0+z ^Tn 


Mq~M\ I 

Ol zl + - 1 = -O \~z a _ + 


2 a/^n 


2 a 


/"\Jn 


U nkn own variance: The above results ( Example 8.25) can be extended to the case when we do 
not know the variance using the T distribution. More specifically, consider the following 
example. 


Example 8.26 

Let X lr X 2 ,...JC n be a random sample from a N(ju, er 2 ) distribution, where ji and a are unknown. 
Design a level a test to choose between 


Hq\ // //q, 

H\-11 i 1 


• Solution 

o Let S 2 be the standard variance for this random sample. Then, the random 
variable W defined as 


W{X x ,X 2 ,-,X n ) 


X /r 0 
SI \(n 


has a /-distribution with n - 1 degrees of freedom, i.e., W ~ T(n - 1). Thus, we 
can repeat the analysis of Example 8.24 here. The only difference is that we need 
to replace a by S and zt by tl n _ l . Therefore, we accept H 0 if 


W \ V 
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and reject it otherwise. Let us look at a numerical example of this case. 


Example 8.27 

The average adult male height in a certain country is 170 cm. We suspect that the men in a 
certain city in that country might have a different average height due to some environmental 
factors. We pick a random sample of size 9 from the adult males in the city and obtain the 
following values for their heights (in cm): 

176.2 157.9 160.1 180.9 165.1 167.2 162.9 155.7 166.2 

Assume that the height distribution in this population is normally distributed. Here, we need to 
decide between 


Hq: n = 170, 

170. 

Based on the observed data, is there enough evidence to reject // () at significance level 
a = 0.05? 


• Solution 

o Let's first compute the sample mean and the sample standard deviation. The 
sample mean is 


■ x l +x 2 +x 3 +x 4 +x 5 +x 6 +x 7 +x 8 +x 9 

9 

= 165.8 

The sample variance is given by 

9 

S 2 = ^rrI>t-*> 2 = 68-01 
V 1 k=l 

The sample standard deviation is given by 

S=^S 2 = 8.25 

The following MATLAB code can be used to obtain these values: 
x=[176.2,157.9,160.1,180.9,165.1,167.2,162.9,155.7,166.2]; 
m=mean(x); 
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v=var(x); 

s=std(x); 

Now, our test statistic is 


X /Xq 

W(X 1 ,X 2 , -,X 9 ) = —jr 
S/yn 

165.8- 170 
8.25/3 


1.52 


Thus, | W | = 1.52. Also, we have 


t \,n~ 1 ? 0 . 025,8 ~ 2 ' 31 


The above value can be obtained in MATLAB using the command tinv(0.975, 8). 
Thus, we conclude 


Therefore, we accept // () . In other words, we do not have enough evidence to 

conclude that the average height in the city is different from the average height in 
the country. 


What if the sample is not from a nonnal distribution? In the case that n is large, we can say 
that 


X-Mo 

W(X 1 ,X 2 ,-,X n )= — r 
S/yn 

is approximately standard nonnal. Therefore, we accept H Q :fx = // () if 


X [Xq 



and reject it otherwise (i.e., accept H^./x^ /x 0 ). 
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Let us summarize what we have obtained for the two-sided test for the mean. 

Table 8.2: Two-sided hypothesis testing for the mean: H 0 :/j.= ju 0 , H { : ju ^ ju 0 . 


Case 

Test Statistic 

Acceptance Region 

Xj ~ N(ju, a"), a known 

x ~l l o 

w- . r 

ol ^1n 

\W\ <z a _ 

2 

n large, X i non-normal 

x ~h 

w = — 

S/yjrt 

1 W\ <z“ 

2 

X i ~ N(ju, a ), a unknown 

X ~H 

w= - 

S/yjn 

1 

R 

« 1 <N 

VI 


One-sided Tests for the Mean: 

We can provide a similar analysis when we have a one-sided test. Let's show this by an 
example. 


Example 8.28 

Let X^2->—y^ n b e a ran dom sample from a N(ju, a 2 ) distribution, where u is unknown and a is 
known. Design a level a test to choose between 


H 0 :[x<n 0 , 


H\. n >/r 0 . 


• Solution 

o As before, we define the test statistic as 


W(x h x 2 ,-,x n ) 


X [Xq 
a! ~\jn 
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If Hq is true (i.e., ji < // 0 ), we expect X (and thus W) to be relatively small, while 

if H l is true, we expect X (and thus IV) to be larger. This suggests the following 
test: Choose a threshold, and call it c. If W < c, accept H Q , and if W > c, accept 
H l . How do we choose cl If a is the required significance level, we must have 


/’(type I error) = /’(Reject // () [ H Q ) 

= P{W > c | // < ju 0 ) < a. 

Here, the probability of type I error depends on //. More specifically, for any 
ju 0 < H, we can write 

/’(type I error | //) = /’(Reject Hq \ ju) 

= P(W> c | n) 


/ 


X-Mo 


zi 


— > C I JU 


at Am 


/ 


\ 


X—ju P Po 

i- + > c | // 

o! \jn a/^jn 


\ 


/ 


X-ju P o P 

> c + -;= | n 


r/^jn a/^jn 


\ 


<P 


l x-„ ' 

> c | n 


\ 


ct/a/h 


( since ju < ju 0 ) 


! 


, X~n 

1 - O(c) ( since given//,- j= ~ N(0, 1)). 

a! A/« 


Thus, we can choose a = 1 - <D>(c), which results in 
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C 


= z 


a ’ 


Therefore, we accept // 0 if 


X /Xq 

/ r — 

a! yn 


and reject it otherwise. 


The above analysis can be repeated for other cases. More generally, suppose that we are given 
a random sample X x ,X 2 ,...,X n from a distribution. Let// = EX { . Our goal is to decide between 

0 , 

H \-M > Mo- 

We define the test statistic as before, i.e., we define W as 

X-Mo 
a/^jn 


X-Mo 
S/^jn 

if a is unknown. If H Q is true (i.e., n < ju Q ), we expect that X (and thus IV) to be relatively 

small, while if H l is true, we expect X (and thus W) to be larger. This suggests the following 
test: Choose a threshold c.lfW< c, accept Hq, and if W > c, accept H x . To choose c, note that 

/’(type I error) = /’(Reject H Q \ H Q ) 

= P{W > c \ m < Mo) 

< P(W >c\n= m 0 ). 


W(X l ,X 2 ,-,X n ) = 


if a = -JVar(X f ) is known, and as 


W{X x ,X 2 ,-,X n ) = 
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Note that the last inequality resulted because if we make pi larger, the probability of W > c can 
only increase. In other words, we assumed the worst case scenario, i.e, pi = pi 0 for the 

probability of error. Thus, we can choose c such that P(W > c \ pi = pi 0 ) = c. By doing this 
procedure, we obtain the acceptance regions reflected in Table 8.3. 

Table 8.3: One-sided hypothesis testing for the mean: H Q :pi < pi Q , H { : pi> pi Q . 


Case 

Test Statistic 

Acceptance Region 

'y 

Xf ~ N(ju, a ), o known 

ii 

2 1 o 

W<z a 

n large, X i non-normal 

X ~H 

W - 

W<z a 

X i ~ N(ju, X), a unknown 

X-fi 0 

w =s,j. 



Note that the tests mentioned in Table 8.3 remain valid if we replace the null hypothesis by 
H = u {] . The reason for this is that in choosing the threshold c, we assumed the worst case 

scenario, i.e, pi = pi Q . Finally, if we need to decide between 


H Q \pi>pi 0 , 

Hy.fi <fl Q , 

we can again repeat the above analysis and we obtain the acceptance regions reflected in Table 
8.4. 


Table 8.4: One-sided hypothesis testing for the mean: H 0 :pi> pi 0 , H^. pi < pi 0 . 


Case 

Test Statistic 

Acceptance Region 

X i ~ N(ju, a 2 ), a known 

x ~l l o 

w- r 

o! \jn 

W> -z 

— a 
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